1,465 research outputs found

    Genome Trees from Conservation Profiles

    Get PDF
    The concept of the genome tree depends on the potential evolutionary significance in the clustering of species according to similarities in the gene content of their genomes. In this respect, genome trees have often been identified with species trees. With the rapid expansion of genome sequence data it becomes of increasing importance to develop accurate methods for grasping global trends for the phylogenetic signals that mutually link the various genomes. We therefore derive here the methodological concept of genome trees based on protein conservation profiles in multiple species. The basic idea in this derivation is that the multi-component “presence-absence” protein conservation profiles permit tracking of common evolutionary histories of genes across multiple genomes. We show that a significant reduction in informational redundancy is achieved by considering only the subset of distinct conservation profiles. Beyond these basic ideas, we point out various pitfalls and limitations associated with the data handling, paving the way for further improvements. As an illustration for the methods, we analyze a genome tree based on the above principles, along with a series of other trees derived from the same data and based on pair-wise comparisons (ancestral duplication-conservation and shared orthologs). In all trees we observe a sharp discrimination between the three primary domains of life: Bacteria, Archaea, and Eukarya. The new genome tree, based on conservation profiles, displays a significant correspondence with classically recognized taxonomical groupings, along with a series of departures from such conventional clusterings

    Join forces or cheat: evolutionary analysis of a consumer-resource system

    Get PDF
    International audienceIn this contribution we consider a seasonal consumer-resource system and focus on the evolution of consumer behavior. It is assumed that consumer and resource individuals live and interact during seasons of fixed lengths separated by winter periods. All individuals die at the end of the season and the size of the next generation is determined by the the consumer-resource interaction which took place during the season. Resource individuals are assumed to reproduce at a constant rate, while consumers have to trade-off between foraging for resources, which increases their reproductive abilities, or reproducing. Firstly, we assume that consumers cooperate in such a way that they maximize each consumer's individual fitness. Secondly, we consider the case where such a population is challenged by selfish mutants who do not cooperate. Finally we study the system dynamics over many seasons and show that mutants eventually replace the original cooperating population, but are finally as vulnerable as the initial cooperating consumers

    Considering scores between unrelated proteins in the search database improves profile comparison

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Profile-based comparison of multiple sequence alignments is a powerful methodology for the detection remote protein sequence similarity, which is essential for the inference and analysis of protein structure, function, and evolution. Accurate estimation of statistical significance of detected profile similarities is essential for further development of this methodology. Here we analyze a novel approach to estimate the statistical significance of profile similarity: the explicit consideration of background score distributions for each database template (subject).</p> <p>Results</p> <p>Using a simple scheme to combine and analytically approximate query- and subject-based distributions, we show that (i) inclusion of background distributions for the subjects increases the quality of homology detection; (ii) this increase is higher when the distributions are based on the scores to all known non-homologs of the subject rather than a small calibration subset of the database representatives; and (iii) these all known non-homolog distributions of scores for the subject make the dominant contribution to the improved performance: adding the calibration distribution of the query has a negligible additional effect.</p> <p>Conclusion</p> <p>The construction of distributions based on the complete sets of non-homologs for each subject is particularly relevant in the setting of structure prediction where the database consists of proteins with solved 3D structure (PDB, SCOP, CATH, etc.) and therefore structural relationships between proteins are known. These results point to a potential new direction in the development of more powerful methods for remote homology detection.</p

    MemBrain: Improving the Accuracy of Predicting Transmembrane Helices

    Get PDF
    Prediction of transmembrane helices (TMH) in α helical membrane proteins provides valuable information about the protein topology when the high resolution structures are not available. Many predictors have been developed based on either amino acid hydrophobicity scale or pure statistical approaches. While these predictors perform reasonably well in identifying the number of TMHs in a protein, they are generally inaccurate in predicting the ends of TMHs, or TMHs of unusual length. To improve the accuracy of TMH detection, we developed a machine-learning based predictor, MemBrain, which integrates a number of modern bioinformatics approaches including sequence representation by multiple sequence alignment matrix, the optimized evidence-theoretic K-nearest neighbor prediction algorithm, fusion of multiple prediction window sizes, and classification by dynamic threshold. MemBrain demonstrates an overall improvement of about 20% in prediction accuracy, particularly, in predicting the ends of TMHs and TMHs that are shorter than 15 residues. It also has the capability to detect N-terminal signal peptides. The MemBrain predictor is a useful sequence-based analysis tool for functional and structural characterization of helical membrane proteins; it is freely available at http://chou.med.harvard.edu/bioinf/MemBrain/

    FastBLAST: Homology Relationships for Millions of Proteins

    Get PDF
    BackgroundAll-versus-all BLAST, which searches for homologous pairs of sequences in a database of proteins, is used to identify potential orthologs, to find new protein families, and to provide rapid access to these homology relationships. As DNA sequencing accelerates and data sets grow, all-versus-all BLAST has become computationally demanding.Methodology/principal findingsWe present FastBLAST, a heuristic replacement for all-versus-all BLAST that relies on alignments of proteins to known families, obtained from tools such as PSI-BLAST and HMMer. FastBLAST avoids most of the work of all-versus-all BLAST by taking advantage of these alignments and by clustering similar sequences. FastBLAST runs in two stages: the first stage identifies additional families and aligns them, and the second stage quickly identifies the homologs of a query sequence, based on the alignments of the families, before generating pairwise alignments. On 6.53 million proteins from the non-redundant Genbank database ("NR"), FastBLAST identifies new families 25 times faster than all-versus-all BLAST. Once the first stage is completed, FastBLAST identifies homologs for the average query in less than 5 seconds (8.6 times faster than BLAST) and gives nearly identical results. For hits above 70 bits, FastBLAST identifies 98% of the top 3,250 hits per query.Conclusions/significanceFastBLAST enables research groups that do not have supercomputers to analyze large protein sequence data sets. FastBLAST is open source software and is available at http://microbesonline.org/fastblast

    Alpha-tocotrienol is the most abundant tocotrienol isomer circulated in plasma and lipoproteins after postprandial tocotrienol-rich vitamin E supplementation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Tocotrienols (T3) and tocopherols (T), both members of the natural vitamin E family have unique biological functions in humans. T3 are detected in circulating human plasma and lipoproteins, although at concentrations significantly lower than α-tocopherol (α-T). T3, especially α-T3 is known to be neuropotective at nanomolar concentrations and this study evaluated the postprandial fate of T3 and α-T in plasma and lipoproteins.</p> <p>Methods</p> <p>Ten healthy volunteers (5 males and 5 females) were administered a single dose of vitamin E [526 mg palm tocotrienol-rich fraction (TRF) or 537 mg α-T] after 7-d pre-conditioning on a T3-free diet. Blood was sampled at baseline (fasted) and 2, 4, 5, 6, 8, and 24 h after supplementation. Concentrations of T and T3 isomers in plasma, triacylglycerol-rich particles (TRP), LDL, and HDL were measured at each postprandial interval.</p> <p>Results</p> <p>After TRF supplementation, plasma α-T3 and γ-T3 peaked at 5 h (α-T3: 4.74 ± 1.69 μM; γ-T3: 2.73 ± 1.27 μM). δ-T3 peaked earlier at 4 h (0.53 ± 0.25 μM). In contrast, α-T peaked at 6 h (30.13 ± 2.91 μM) and 8 h (37.80 ± 3.59 μM) following supplementation with TRF and α-T, respectively. α-T was the major vitamin E isomer detected in plasma, TRP, LDL, and HDL even after supplementation with TRF (composed of 70% T3). No T3 were detected during fasted states. T3 are detected postprandially only after TRF supplementation and concentrations were significantly lower than α-T.</p> <p>Conclusions</p> <p>Bio-discrimination between vitamin E isomers in humans reduces the rate of T3 absorption and affects their incorporation into lipoproteins. Although low absorption of T3 into circulation may impact some of their physiological functions in humans, T3 have biological functions well below concentration noted in this study.</p

    First Neutrino Observations from the Sudbury Neutrino Observatory

    Get PDF
    The first neutrino observations from the Sudbury Neutrino Observatory are presented from preliminary analyses. Based on energy, direction and location, the data in the region of interest appear to be dominated by 8B solar neutrinos, detected by the charged current reaction on deuterium and elastic scattering from electrons, with very little background. Measurements of radioactive backgrounds indicate that the measurement of all active neutrino types via the neutral current reaction on deuterium will be possible with small systematic uncertainties. Quantitative results for the fluxes observed with these reactions will be provided when further calibrations have been completed.Comment: Latex, 7 pages, 10 figures, Invited paper at Neutrino 2000 Conference, Sudbury, Canada, June 16-21, 2000 to be published in the Proceeding

    CMB Telescopes and Optical Systems

    Full text link
    The cosmic microwave background radiation (CMB) is now firmly established as a fundamental and essential probe of the geometry, constituents, and birth of the Universe. The CMB is a potent observable because it can be measured with precision and accuracy. Just as importantly, theoretical models of the Universe can predict the characteristics of the CMB to high accuracy, and those predictions can be directly compared to observations. There are multiple aspects associated with making a precise measurement. In this review, we focus on optical components for the instrumentation used to measure the CMB polarization and temperature anisotropy. We begin with an overview of general considerations for CMB observations and discuss common concepts used in the community. We next consider a variety of alternatives available for a designer of a CMB telescope. Our discussion is guided by the ground and balloon-based instruments that have been implemented over the years. In the same vein, we compare the arc-minute resolution Atacama Cosmology Telescope (ACT) and the South Pole Telescope (SPT). CMB interferometers are presented briefly. We conclude with a comparison of the four CMB satellites, Relikt, COBE, WMAP, and Planck, to demonstrate a remarkable evolution in design, sensitivity, resolution, and complexity over the past thirty years.Comment: To appear in: Planets, Stars and Stellar Systems (PSSS), Volume 1: Telescopes and Instrumentatio

    Testing statistical significance scores of sequence comparison methods with structure similarity

    Get PDF
    BACKGROUND: In the past years the Smith-Waterman sequence comparison algorithm has gained popularity due to improved implementations and rapidly increasing computing power. However, the quality and sensitivity of a database search is not only determined by the algorithm but also by the statistical significance testing for an alignment. The e-value is the most commonly used statistical validation method for sequence database searching. The CluSTr database and the Protein World database have been created using an alternative statistical significance test: a Z-score based on Monte-Carlo statistics. Several papers have described the superiority of the Z-score as compared to the e-value, using simulated data. We were interested if this could be validated when applied to existing, evolutionary related protein sequences. RESULTS: All experiments are performed on the ASTRAL SCOP database. The Smith-Waterman sequence comparison algorithm with both e-value and Z-score statistics is evaluated, using ROC, CVE and AP measures. The BLAST and FASTA algorithms are used as reference. We find that two out of three Smith-Waterman implementations with e-value are better at predicting structural similarities between proteins than the Smith-Waterman implementation with Z-score. SSEARCH especially has very high scores. CONCLUSION: The compute intensive Z-score does not have a clear advantage over the e-value. The Smith-Waterman implementations give generally better results than their heuristic counterparts. We recommend using the SSEARCH algorithm combined with e-values for pairwise sequence comparisons
    corecore